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1 Introduction 



Constraint satisfaction problems (CSPs) models many important intractable 
A/'P-hard problems such as propositional satisfiability problem (SAT) [Tj. Al- 
gorithms with non-trivial upper bounds on running time for restricted SAT 
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with bounded clause length k (fc-SAT) can be classified into three styles: 
DPLL-like, PPSZ-like and Local Search [2], with local search algorithms hav- 
ing already been generalized to CSP with bounded constraint arity k (A;-CSP) 
[5]. We generalize a DPLL-like algorithm in its simplest form and a PPSZ- 
like algorithm [4\ from fc-SAT to A;-CSP. As far as we know, this is the first 
attempt to use PPSZ-like strategy to solve fc-CSP, and before little work has 
been focused on the DPLL-like or PPSZ-like strategies for /c-CSP. 

For the DPLL-like deterministic A;-CSP algorithm, a recurrent inequality is 
tightly solved to get a non-trivial upper bound 0*(((i— ^^)") on running time, 
where n is the number of variables and d is the domain size of variables in 
input. For the PPSZ-like randomized fc-CSP algorithm, the Satisfiability Cod- 
ing Lemma [3] is extended to non-Boolean case to show that with probability 
approaching 1, a satisfying assignment can be found in time 0*((d^^^)"'). 
O* indicates that some polynomial factor in n is ignored in big-0 natation. 

CSP generalizes SAT in two aspects: each variable can have more than two 
available values, and each constraint can have more than one falsifying partial 
assignments. These falsifying partial assignments to some constraint are called 
nogoods. For example, in graph 3-coloring problem a constraint of two variables 
X and y with domain {0,1,2} has tuples of values such as {x : 0,y : 0), 
{x : l,y : 1) and {x : 2,y : 2) as its nogoods. If the first k — 1 variables in a 
constraint with arity k have values in agreement with a nogood, then the last 
variable cannot have the value specified by the nogood, so as not to falsify this 
constraint. Such a variable is thus called narrowly chosen. Our generalizations 
rooted from the key observation that nogoods (instead of constraints) in CSP 
can be treated as clauses in SAT to produce narrowly chosen variables which 
can be exploited by algorithms to reduce their search efforts. 
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The remainder of the paper is organized as follows. Section 2 describes and 
analyzes the generalized DPLL-like deterministic /c-CSP algorithm. Section 
3 extends the original satisfiability coding lemma [4J to non-Boolean case. 
Section 4 presents the PPSZ-like randomized fc-CSP algorithm and its analysis. 

2 The DPLL-like Deterministic A;-CSP Algorithm 

Our DPLL-like algorithm for fc-CSP with variables domain size d works as 
follow: for any nogood {ui : ai, ...,Mfc : a/c), branch on Mi to (i — 1 branches, 
on each branch a value other than oi and also different from value assigned 
on other sister branches is assigned to Ui and then recursively go down the 
branch. If all these branch fails to find a satisfying assignment, then fix ui to 
value ai and branch on U2 in exactly the same way as on ui except that this 
time the number of remaining variables decreased by one. Denote the running 
time of this algorithm by T{n), then clearly for d > 2 and k >2: 

T{n) <{d- l)(r(n - 1) + ... + T(n - k)) + poly(n). (1) 

Note that as usual we can safely ignore the additive poly(n) term at right 
hand side and treat the inequality as an equation. 

When c? is a fixed constant, linear recursion (1) has solution T{n) = 0*(A") 
with A the maximum root in characteristic equation /(A) = A'^ — (d — 1)(A*''^^ + 
... -|- 1) = 0. Since A > 1, our trick is to find the maximum root in equation 
^(A) = (A - 1)/(A) = X^+^ - dX'' + {d- 1) = 0. g{X) is strictly increasing 
when A > d{l — -j^)- We can find that when X > d — g{X) > 0; when 
A = d-^, g{X) < 0. Hence the tight solution of (1) is T{n) = 0*((rf-^)"). 
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When d is not fixed and varies with n, specifically d = n°' with a a constant, 
this case models some practical problems (e.g. the Latin square problem and 
the iV-queen problem) and a random CSP model (called Model RB ), which 
contains many hard instances seemingly quite challenging for various kinds of 
algorithms, both theoretically ^ and experimentally [6], and a trivial upper 
bound is 0*(n""). Rewrite the recursion (1) as 

T{n) = (n" - l)(T(n - 1) + ... + T{n - k)). (2) 

When a < 1, for any fixed e > 0, for large n with — 1 > we have 
l]"=n_fe^(0 < ^T{n), so for large enough n (actually n + 1 will be fine for 
above n): T{n) < (n" - l){T{n - 1) + YJlZn-kT{i)) < (^° - l)(T(n - 1) + 
T.'IZn-k-iT{i)) < (r2°-l)(T(n-l) + eT(n-l)) < n"(l+e)T(r2-l). Substitute 
n by smaller numbers and combine these inequalities, we have for any fix 
number e > 0: T{n) = 0*((n!)"(l + e)") = 0*((f )°"(1 + e)"). 

When a > 1, there is some number (3 with 1 < (3 < a, such that for sufficiently 
large n, (n - 1)" - 1 > n^, so T{n) < (n" - l)(T(n - 1) + E7=n-kTii)) < 
K-l)m^-l) + Er=„VinO) = (n"-l)(T(n-l)+ (;^^T(n-l)) < 
?T,"(1 + -;^)T{n — 1). Since nj^i(l + converges to a finite number, by 
applying the same analysis as in above paragraph, we have T{n) = 0*((^)""). 

3 A Generalized Satisfiability Coding Lemma 

Abbreviation w.r.t. means with respect to. Our key generalization to a defini- 
tion in [4] about isolated points, critical point and critical variables is: 

Definition 1. For a A;-CSP instance F with domain D for its n variables. 
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call X = (ai, Oj, a„) an isolated point w.r.t. a set 5* C if there exists 
a dimension i G {l,2...,n} and an E D — {ai} such that X G 5 but 
(«!, a^, a^i) ^ 5'. Call such a dimension i a critical point of X w.r.t. 
5* and the variable Ui at dimension i a critical variable. 

We only require that i/iere exist a[ E D — {a.i\ such that X E S but X' = 
(ai, a^, a„) ^ S*, rather than that /or all a[ E D — {aj} (which can only 
work for SAT but not for CSP). This right choice (which works for both SAT 
and CSP) makes the following two generalized lemmas and the generalized 
algorithm with analysis in next section straightforward to follow the routine 
in as follows. 

Denote the number of critical points of X w.r.t. S by Js{X). Call X j-isolated 
w.r.t. S* if X is an isolated point in exactly j dimensions w.r.t. S. Call an 
n-isolated solution X an isolated solution. When S is the set of all solutions of 
F, we can omit the words w.r.t. S. When solution X = (ai, ...,aj, ...,a„) has 
a critical point i, there must be a constraint with a nogood in agreement with 
X except only in flipping to some a[ E D — {ai}. Call such a constraint 
critical. In any value assigning sequence of variables, if a critical variable Ui 
is assigned value last among all the variables in its critical constraint, and 
all other variables than Ui are assigned values in agreement with X, then 
the value a[ should not be assigned to Ui (otherwise the critical constraint 
will be falsified), thus the domain of Ui is narrowed. Call such a variable 
Ui narrowly chosen., otherwise fully chosen. For any given partial assignment 
and any constraint, we can efficiently check if a variable in this constrain is 
narrowly chosen: it is narrowly chosen iff other variables in this constrain has 
assigned values in agreement with a nogood for this constraint, and every 
constraint with arity k can have at most nogoods. 
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Lemma 1 Let F be a fc-CSP instance with a j'-isolated solution X. Then 
over all value assigning sequences of variables with the final value assignment 
X, the average number of narrowly chosen variables is at least j/k, thus the 
average number of fully chosen ones is at most n — j /k. 

Proof: (As in [Ij) For a random value assigning sequence a, since no constraint 
involves more than k variables in a /c-CSP instance, the probability that a 
critical variable is assigned last among all the variables in its critical constraint 
is at least 1/k. For each critical constraint, if the corresponding critical variable 
is last assigned, then this variable will be narrowly chosen. The j-isolated 
solution X has exactly j critical points and these j critical variables each has 
a critical constraint. Thus, the average number of narrowly chosen variables is 
at least j /k when X is the final assignment. With a total number of variables 
n, the average number of fully chosen variables is no more than n—j/k. Q.E.D. 

Lemma 2 If a nonempty set S C with \D\ = d, then Z^^esl^)"""^'*'^'* > 1- 

Proof: (By induction on as in [^.) Case n = is trivially true. For n > 0, 
consider a fixed dimension, say n. Assume D = {ai, a^} and divide the set 
S into d subsets Si, Sd, such that Si = S^ x {ai} with S[ the projection of 
Si to the first n — 1 dimensions. For any X in nonempty Si, denote the image 
of X in SI by X', then induction hypothesis says Z]xes'(^)" "^K^^^ ^ > 1- 
Since S is nonempty, some Sj is nonempty. For any X G Sj, dimension n is 
surely a critical point of X w.r.t. Sj, so Js^(X) = Js'^{X') + 1. On the other 
hand, dimension n is a critical point of X w.r.t. S iff some Si is empty. Say 
Si is empty, then dimension n is a critical point of X w.r.t. S, so Js{X) = 
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point of X w.r.t. so Js{X) = Js,{X) - 1. In this case ExesiiT'^^'^''^ = 
4 PPSZ-like Randomized A;-CSP Algorithm 

Our PPSZ-hke algorithm for A;-CSP and its analysis generalize from one for 
A;-SAT |1] with the key observation that we can use a partial assignment and 
nogoods to efficiently produce narrowly chosen variables w.r.t. some value 
assigning sequence of variables, as explained in introduction and last sections. 

Algorithm A 

repeat n{n + l)(c/^ {d — times 

while there exists an unassigned variable 

select an unassigned variable y at random 
if y is narrowly chosen 

then set y io a. random value in the narrowed domain 
else set y to & random value in its full domain 
if the CSP instance is satisfied, then output the assignment 

Now we prove that Algorithm A can find a solution to a satisfiable fc-CSP 
instance F in time with probability approaching 1. Suppose 

that X is an j-isolated solution of F with j critical points (1 < j < since 
j = is a trivial case of tautology input without any nogood). In one iteration 
of the repeat loop, by lemma 1, the average number of critical variables as- 
signed last among all the variables in its critical constraint is at least j /k, over 
the random value assigning sequences of variables in the while loop. Then by 
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Markov inequality (on complement event), the probability of the event that 
for at least j /k critical constraints, the critical variables occur last among the 
variables in the critical constraint, is at least — ^-tt- When this event occurs, 

' n—]/k+l ' 

the number of fully (narrowly) chosen variables is at most n — j/k (at least 
j/k), and each fully (narrowly) chosen variable's value has probability exact 
^ (at least ■^^) to agree with the corresponding value of X, so the probability 
of the event that the values assigned to the variables in while loop agree with 
the assignment X is at least (^)'^~-'^*^(^rj;)-'^*^ conditioned on the above event. 
Thus, the probabihty that a j-isolated solution X of F is output by algorithm 
A is at least ^3^y^(^)"^~-'^*^(^)'''''^- By summing up this probability over 
set S of all solutions of F and by lemma 2, the probability that algorithm A 
outputs some solution is at least Exes n-Js{x)/k+i (|)^--^^(^)/^(^)-^^W/^ > 

1 (l\n-n/k(x^ ( l\n-Js(X)\l/k( 1 \J.c;(X)/k ^ 1 (l\n-n/k 1 ( 1 \n/k 
■ \l^xeS\d) I \d-l' — n+l\d' ' ^ ' \d-l' 

;^((iy^)-". So by repeating the while loop n(n+l)(dy^)" = 0*{{d^[^Y) 
times, we can find a satisfying assignment with probability approaching 1. 

When d = n", this upper bound becomes O* (n""*-^" fcn^Vn )) . 

5 Conclusion and Future Work 

We have generahzed two algorithms from /c-SAT to A;-CSP, with running time 
better than the trivial bound 0*((i") when variable domain size d is fixed. 
When d is unfixed, say d — rf, the result is only shghtly better than the trivial 
bound 0*(n""), whether we can reach 0*(n'^") (where /3 < a is a constant) 
in this case is still open. Our solutions to the recursion (1) and (2) might find 
other application in the analysis of DPLL-like algorithms. Our randomized 
algorithm is the first application of PPSZ-like strategy beyond SAT to CSP. 
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In summary, this paper can be viewed as the first step toward estabhshing 
upper bounds for solving /c-CSP using DPLL-like or PPSZ-like strategies, 
which leaves mcuh room for further study and improvement, for example, by 
combining PPSZ-like and local search algorithms as in [3]. 
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